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Description 

[0001 ] The present invention provides the use of a fusion protein comprising a site-specific DN A recombinase domain 
and a protein transduction domain for preparing an agent for inducing target gene alteration in a living organism, 
5 suitable fusion proteins and a method for the production of said fusion proteins. 

Background 

[0002] For some years targeted mutagenesis in totipotent mouse embryonic stem (ES) cells has been used to inac- 

10 tivate genes, for which cloned sequences were available (Capecchi Trends in Genetics 5, 70 - 76 (1989)). Since ES 
cells can pass mutations induced in vitro to transgenic offspring in vivo, it is possible to analyze the consequences of 
gene disruption in the context of the entire organism. Thus, numerous mouse strains with functionally inactivated genes 
("knock out mice") have been created by this technology and utilized to study the biological function of a variety of genes. 
[0003] A refined method of targeted mutagenesis, referred to as conditional mutagenesis, employs a site-specific 

15 recombination system (e.g. Cre/loxP or Flp/frt - Sauer and Henderson, N. Proc. Natl. Acad. Sci. USA 85, 5166-5170 
(1988); Senecoff et al., J. Mol. Biol., 201, 405 - 421 (1988)) which enables a temporally and/or spatially restricted 
alteration of target genes (Rajewsky et al, J. Clin. Invest., 98, 600 - 603 (1996)). The creation of conditional mouse 
mutants requires the generation of two mouse strains, i.e. the recombinase recognition strain and the recombinase 
expressing strain. The recombinase recognition strain is generated by homologous recombination in ES cells as de- 

20 scribed above except that the targeted exon(s) is (are) flanked by two recombinase recognition sequences (hereinafter 
"RRS"; e.g. loxP or frt). The type of recombination event mediated by the recombinase depends on the disposition of 
the RRS, with deletions, inversions, translocations and integrations being possible (Torres and Kuhn, Oxford Univerity 
Press, Oxford, New York (1 997)). By placing the RRS into introns, an interference with gene expression before recom- 
bination can be avoided. The recombinase expressing strain contains a recombinase transgene (e.g. Cre, Flp) whose 

25 expression is either restricted to certain cells and tissues or is inducible by external agents. Crossing of the recombinase 
recognition strain with the recombinase expressing strain recombines the RRS-flanked exons from the doubly trans- 
genic offspring in a prespecified temporally and/or spatially restricted manner. Thus, the method allows the temporal 
analysis of gene function in particular cells and tissues of otherwise widely expressed genes. Moreover, it enables the 
analysis of gene function in the adult organism by circumventing embryonic lethality which is frequently the conse- 

30 quence of gene mutation. For pharmaceutical research, aiming to validate the utility of genes and their products as 
targets for drug development, inducible mutations provide an excellent genetic tool. However, the current systems for 
inducible recombinase expression in transgenic animals suffer from a certain degree of leakyness in the absence of 
the inducer (Kuhn et al., Science 269(5229): 1 427-9 (1 995); Schwenk et al., Nucleic Acids Res; 26(6):1 427-32 (1 998)). 
Furthermore, the generation of conditional mutants is a time consuming and labor intensive procedure, since the re- 

35 combinase recognition strain and the recombinase expressing strain have to be breed at least over two generations 
in order to obtain animals carrying both, the recombinase transgene and two copies of the RRS-flanked target gene 
sequence. 

[0004] Protein domains that have the ability to cross cell membranes were identified in the Antennapedia protein 
from Drosophila (Vives et al., J Biol Chem, 272(25):16010-7 (1997)), VP22 from HSV (Elliott and O'Hare, Cell, 88(2): 

40 223-33 (1997)) and TAT from HIV (Green and Loewenstein, Cell, 55(6): 11 79-88 (1988); Frankel and Pabo, Cell, 55(6): 
11 89-93 (1 988)). Fusion of such domains to heterologous proteins conferred the ability to transduce into cultured cells 
(Fawell et al., Proc Natl Acad Sci USA, 91(2):664-8 (1994); Elliott and O'Hare (1997), Phelan et al., Nature Biotech. 
1 6; 440-443 (1 998) and Dilber et al., Gene Ther., 6(1 ):1 2-21 (1 999)). Dalby and Bennett showed that a fusion protein 
consisting of VP22 and functional Flp recombinase translocated between cells in culture (from COS-1 cells transfected 

^5 with VP22-Flp to CHO cells carrying Flp recognition sites (FRT sites); see Dalby and Bennett, Invitrogen, Expressions 
6.2, page 13(1999)). 

[0005] On the other hand, a recent report demonstrated that the B-galactosidase protein fused to the 1 1 amino acids 
transduction domain from the HIV TAT protein can infiltrate all tissues of living mice reaching every single cell (Schwarze 
et al., Science, 285(5433): 1569-72 (1999)). 
so [0006] It was found that site-specific DNA recombinases can be translocated into cells of a living organism when 
fused to a protein transduction domain. Thus, whenever a gene mutation is desired, recombination is induced upon 
the injection of the appropriate site-specific recombinase fused to a transduction domain into such a living organism 
(provided, however, that said organism carries at least one appropriate RRS integrated in the genome). 
[0007] The present invention thus provides 

55 

(1) the use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 
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(b) a protein transduction domain 

for preparing an agent for inducing target gene alterations in a living organism, wherein said living organism 
carries at least one or more recognition sites for said site-specific DNA recombinase integrated in its genome; 

(2) a method for inducing gene alterations in a living organism which comprises administering to said living organism 
a fusion protein comprising a site-specific DNA recombinase domain and a protein transduction domain as defined 
in (1) above, wherein said living organism carries at least one or more recognition sites for said site-specific DNA 
recombinase integrated in its genome; 

(3) a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain 

provided that when the site-specific DNA recombinase domain is wild type Flp then the protein transduction 
domain is not the VP22 protein of HSV (i.e., the fusion protein is not identical to the fusion protein of Dalby 
and Bennett (1999)); 

(4) a DNA sequence coding for the fusion protein of (3) above; 

(5) a vector comprising the DNA sequence as defined in (4) above; 

(6) a host cell transformed with the vector of (5) above and/or comprising the DNA of (4) above; 

(7) a method for producing the fusion protein of (1) above which comprises culturing the transformed host cell of 
(6) above and isolating the fusion protein; and 

(8) an injectable composition comprising the fusion protein as defined in (1) or (3) above. 

[0008] The invention is further illustrated by the appended Figure and is explained in detail below. 
[0009] Figure 1 : Generation of induced mouse mutants using purified fusion proteins. 

A: Expression of the fusion protein consisting of the site-specific DNA recombinase (e.g. Cre) and the protein trans- 
duction domain (e.g. the HIV derived TAT peptide) in prokaryontic or eukaryontic cells. 
B: Extraction and purification of the expressed fusion protein (e.g. as described in Nagahara et al., 1998). 
C: Injection of the purified fusion protein into mice carrying the RRS-flanked target sequence. 
D: Analysis of the pattern of induced target gene recombination and the resulting phenotype. 
Triangle: RRS. 

[0010] The expression "target sequences" according to the present invention means all kind of sequences which 
may be mutated (viz. deleted, translocated, integrated and/or inverted) by the action of the recombinase. The number 
of RRS in the target sequence depends on the kind of mutation to be performed by the recombinase. For most of the 
mutations (especially for deletions and invertions) two RRS are required which are flanking the sequence to be mutated 
(deleted or inverted). For some kinds of integrations only one RRS may be necessary within the target sequence. 
[0011] The "living organisms" according to the present invention are multi-cell organisms and can be vertebrates 
such as mammals (e.g., rodents such as mice or rats) or non-mammals (e.g., fish) or can be invertebrates such as 
insects or worms. Most preferred living organisms are mice and fish. 

[001 2] The site-specific DNA recombinase domain within the fusion protein of the invention of the present application 
is preferably selected from a recombinase protein derived from Cre, Flp, <J>C31 recombinase (Thorpe and Smith, Proc. 
Natl. Acad. Sci, USA, vol. 95, 5505-5510 (1998)) and R recombinase (Araki et al., J. Mol. Biol., 182, 191-203 (1985)). 
The preferred recombinases are Cre (e.g., the Cre variant of aa 15 to 357 of SEQ ID NO: 2 or aa 325-667 of SEQ ID 
NO: 6) and Flpe (i.e., the Flp variant of aa 15 to 437 of SEQ ID NO: 4 or aa 325 to 747 of SEQ ID NO: 8). 
[0013] The protein transduction domain according to the present invention is preferably derived from the Antenna- 
pedia protein of Drosophila, from the VP22 protein of HSV or from the TAT protein of HIV. Preferrably the protein 
transduction domain is derived from the TAT protein among which a TAT protein comprising the amino acid sequence 
shown in SEQ ID NO: 10 is most preferred. 

[0014] The fusion of the two domains of the fusion protein can occur at any possible position, i.e., the protein trans- 
duction domain can be fused to the N- or C-terminal of the site-specific DNA recombinase or can be fused to active 
sites within the site-specific DNA recombinase. Preferrably the protein transfusion domain is fused to the N-terminal 
of the site-specific DNA recombinase domain. 

[0015] The protein transduction domain can be fused to the site-specific DNA recombinase either through a direct 
chemical bond or through a linker molecule. Such linker molecule can be any bivalent chemical structure capable of 
linking the two domains. The preferred linker molecule according to the present invention is a short peptide, e.g., having 
1 to 20, preferrably 1 to 1 0, amino acid residues. Specifically preferred short peptides are essentially consisting of Gly, 
Ala and/or Leu. 

[001 6] The fusion protein of the invention of the present application may further comprise other functional sequences 
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such as secretion conferring signals, nuclear localization signals and/or signals conferring protein stabilization. 
[0017] In case the fusion protein comprises a protein transduction domain derived from the TAT protein of HIV, the 
DNA sequence coding for said fusion protein preferrably comprises the sequence 

5' TAC GGC CGC AAG AAG CGC CGC CAA CGC CGC CGC 3'. 



[0018] Such a preferred DNA sequence is for instance shown in SEQ ID NO: 11 . In said sequence the 3' terminal 
io codon ggc codes for the linker Gly. The DNA sequence of a suitable recombinase may be directly attached to said 
codon ggc. 

[0019] The fusion protein can be obtained by the following steps: 

1 . Fusion of the recombinase coding region (e.g. encoding Cre: see amino acids 15 to 357 of SEQ ID NO: 2) with 
15 the sequence conferring protein translocation (e.g. the sequence encoding the TAT peptide YGRKKRRQRRR, 

SEQ ID NO: 1 0) using standard cloning protocols (Maniatis et al., Cold Spring Harbor Laboratory, New York (1 989)) 
or chemical synthesis. 

2. Generation of a construct for the expression of the fusion protein in prokaryotic or eukaryotic cells, e.g. in E. 
coli DH5a (Hanahan, J. Mol. Biol.;166(4):557-80 (1983)) using the QIAexpress pQE vector (Qiagen, Hilden). 

20 3. Expression of the above mentioned fusion protein in prokaryotic or eukaryotic cells, e.g. in E. coli DH5a (Hana- 

han, 1983) 

4. Extraction and purification of the above mentioned fusion protein e.g. as described in Nagahara et al., Nat. Med., 
4(1 2):1 449-52 (1998). 

25 [0020] Injection of the purified fusion protein into a living organism (e.g., a mouse) carrying a gene comprising the 
RRS-flanked target sequence (e.g., in an amount of 5 to 50 ng per g body weight). To demonstrate the feasibility of 
the invention, a reporter mouse strain is used harbouring a RRS-flanked flanked cassette which, when deleted by the 
recombinase, allows the expression of a cellular marker protein, such as (3-galacosidase (Thorey et al., Mol. Cell Biol., 
18(10):6164(1998)). 

30 [0021] Analysis is achieved by determining the pattern of induced target gene recombination (e.g. through Southern 
blot analysis and X-Gal staining on tissue sections; Maniatis et al., 1 989; Gossler and Zachgo, Joyner AL (Ed.), Oxford 
Univerity Press, Oxford, New York (1993)). 

[0022] The procedure's advantages over current technology are as follows: 

35 (j) The absence of background recombination before administration of the fusion protein. 

(ii) The reduction of time and resources which are necessary to combine the recombinase transgene and two 
copies of the RRS-flanked target gene by conventional breeding. 



4 



EP 1 118 668 A1 



SEQUENCE LISTING 

<110> ARTEMIS Pharmaceuticals GmbH 



10 



<120> Transduction of recombinases for inducible 
gene targeting 

<130> 992928ep/JH/ml 

<140> 
<141> 



15 



<160> 12 

<170> Patentln Ver. 2.1 



20 



25 



30 



35 



<210> 1 
<211> 1074 
<212> DNA 

<213> Artificial Sequence 
<220> 



<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 



<220> 
<221> CDS 
<222> (1). 



(1071) 



<400> 1 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec 48 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 



40 



acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 144 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 192 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 



45 



50 



55 



teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 
Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 



240 



gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 
Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 * 95 



ctg gca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 
Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 



336 
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cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea 384 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 

115 120 125 

ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gee ggt gaa cgt 432 

Leu Val Met Arg Arg lie Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 480 

w Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 

145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp lie Arg Asn Leu Ala 

j5 165 170 175 

ttt ctg ggg att get tat aac ace ctg tta cgt ata gee gaa att gec 576 

Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu Arg lie Ala Glu lie Ala 
180 185 190 

20 agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 

Arg lie Arg Val Lys Asp lie Ser Arg Thr Asp Gly Gly Arg Met Leu 

195 200 205 



25 



30 



35 



ate cat att ggc aga acg aaa acg ctg gtt age acc gca ggt gta gag 672 
He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gca ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720 
Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gtc 768 
Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

aga aaa aat ggt gtt gec gcg cca tct gee acc age cag eta tea act 816 
Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 



cgc gec ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
40 275 280 285 

get aag gat gac tct ggt cag aga tac ctg gee tgg tct gga cac agt 912 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 

290 295 300 

45 gec cgt gtc gga gee gcg cga gat atg gee cgc get gga gtt tea ata 960 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 

ccg gag ate atg caa get ggt ggc tgg acc aat gta aat att gtc atg 1008 

so Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 

325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 

340 345 350 

55 
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ctg gaa gat ggc gat tag 1074 
Leu Glu Asp Gly Asp 
355 



<210> 2 
<211> 357 
<212> PRT 

<213> Artificial Sequence 
10 <223> Description of Artificial Sequence: DNA sequence 

coding for a fusion protein TAT-Cre 

<400> 2 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
20 35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 



25 



Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 

30 Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 

100 105 110 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

35 Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 

130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
50 210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 



40 



45 



55 



Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 
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10 



15 



Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly lie Phe Glu Ala Thr His Arg Leu lie Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser lie 
305 310 315 320 

Pro Glu lie Met Gin Ala Gly Gly Trp Thr Asn Val Asn lie Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp 
20 355 

<210> 3 
<211> 1317 
<212> DNA 
25 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

30 

<220> 

<221> CDS 

<222> (1) . . (1311) 

<400> 3 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg agt 48 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 ^ 15 

caa ttt gat ata tta tgt aaa aca cca cct aag gtc ctg gtt cgt cag 96 
Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 30 

ttt gtg gaa agg ttt gaa aga cct tea ggg gaa aaa ata gca tea tgt 144 
Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 
35 40 45 

get get gaa eta acc tat tta tgt tgg atg att act cat aac gga aca 192 
Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 
50 55 60 

50 gca ate aag aga gee aca ttc atg age tat aat act ate ata age aat 240 

Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 
65 70 75 80 

teg ctg agt ttc gat att gtc aac aaa tea etc cag ttt aaa tac aag 288 
Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
55 85 90 95 



35 



40 



45 
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10 



15 



25 



ttg act aat gtt gtg gga aat tgg age gat aag cgt get tct gee gtg 1056 
Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 * 350 

gee agg aca acg tat act cat cag ata aca gca ata cct gat cac tac 1104 
Ala Arg Thr Thr. Tyr Thr His Gin lie Thr Ala He Pro Asp His Tyr 
355 360 365 



ttc gca eta gtt tct egg tac tat gca tat gat cca ata tea aag gaa 1152 

Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser Lys Glu 

370 375 380 

atg ata gca ttg aag gat gag act aat cca att gag gag tgg cag cat 1200 

Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp Gin His 

385 390 395 400 

ata gaa cag eta aag ggt agt get gaa gga age ata cga tac ccc gca 1248 

20 He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr Pro Ala 

405 410 415 

tgg aat ggg ata ata tea cag gag gta eta gac tac ctt tea tec tac 1296 

Trp Asn Gly He He Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 

420 425 * 430 



ata aat aga cgc ata taatga 1317 
He Asn Arg Arg lie 
435 



30 

<210> 4 
<211> 437 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
35 coding for a fusion protein TAT-Flpe 

<400> 4 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

40 

Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 30 

Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 
35 40 45 

Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 
50 55 60 

Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 
65 70 75 80 

Ser Leu Ser Phe Asp lie Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
85 90 " 95 

Thr Gin Lys Ala Thr lie Leu Glu Ala Ser Leu Lys Lys Leu lie Pro 
55 100 105 110 



45 



50 
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Ala Trp Glu 
115 



Phe Thr lie lie Pro Tyr 
120 



Asn Gly Gin Lys 
125 



His Gin Ser 



10 



Asp lie Thr 
130 

Glu Glu Ala 
145 

Leu Leu Ser 



Asp lie 
Asp Lys 



Glu Gly 
165 



Val Ser Ser Leu 
135 

Gly Asn Ser His 
150 

Glu Ser lie Trp 



Gin Leu Gin Phe 
140 

Ser Lys Lys Met 
155 

Glu lie Thr Glu 
170 



Glu Ser Ser 



Leu Lys Ala 
160 

Lys lie Leu 
175 



15 



20 



Asn Ser Phe 



Gin Phe Leu 
195 

lie Lys Asn 
210 

Leu Gly Val 
225 



Glu Tyr Thr Ser Arg Phe 
180 * 185 

Phe Leu Ala Thr Phe lie 
200 

Val Asp Pro Lys Ser Phe 
215 

lie He Gin Cys Leu Val 
230 



Thr Lys Thr Lys 



Asn Cys Gly Arg 
205 

Lys Leu Val Gin 
220 

Thr Glu Thr Lys 
235 



Thr Leu Tyr 
190 

Phe Ser Asp 
Asn Lys Tyr 



Thr Ser Val 
240 



25 



Ser Arg His 
Val Tyr Leu 



lie Tyr 
245 

Asp Glu 
260 



Phe Phe Ser Ala 



Phe Leu Arg Asn 
265 



Arg Gly Arg He 
250 

Ser Glu Pro Val 



Asp Pro Leu 
255 

Leu Lys Arg 
270 



30 



35 



40 



Val Asn Arg 
275 

Leu Lys Asp 
290 

Ala Pro Tyr 
305 

Gly Arg His 



Leu Thr Asn 



Thr Gly Asn Ser Ser Ser 
280 

Asn Leu Val Arg Ser Tyr 
295 

Pro He Phe Ala He Lys 
310 



Leu Met 
325 

Val Val 
340 



Thr Ser Phe Leu 



Gly Asn Trp Ser 
345 



Asn Lys Gin Glu 
285 

Asn Lys Ala Leu 
300 

Asn Gly Pro Lys 
315 

Ser Met Lys Gly 
330 

Asp Lys Arg Ala 



Tyr Gin Leu 
Lys Lys Asn 



Ser His He 
320 

Leu Thr Glu 
335 

Ser Ala Val 
350 



45 



50 



Ala Arg Thr 
355 

Phe Ala Leu 
370 

Met He Ala 

385 

He Glu Gin 



Thr Tyr 
Val Ser 
Leu Lys 



Leu Lys 
405 



Thr His Gin He 
360 

Arg Tyr Tyr Ala 
375 

Asp Glu Thr Asn 
390 

Gly Ser Ala Glu 



Thr Ala He Pro 

365 

Tyr Asp Pro He 
380 

Pro lie Glu Glu 
395 

Gly Ser He Arg 
410 



Asp His Tyr 
Ser Lys Glu 



Trp Gin His 
400 

Tyr Pro Ala 
415 



55 



Trp Asn Gly 



He He Ser Gin Glu Val 
420 425 



Leu Asp Tyr Leu 



Ser Ser Tyr 
430 
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He Asn Arg Arg He 
435 



<210> 5 
<211> 2004 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 



15 <220> 

<221> CDS 

<222> (1) . . (2001) 

<400> 5 

20 atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 

15 10 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

ccc gat agt ccg cct gac acc tec cgc cgt ggc gec eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 ^ 55 60 

tat gee etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 240 
35 Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 

65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
40 85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gec gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 



25 



30 



45 



50 



55 



cgc aca ccc acc acc gee ccc egg gec ccc cga acc cag egg gtg gcg 384 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 

115 120 125 

act aag gee ccc gcg gee ccg gcg gcg gag acc acc cgc ggc agg aaa 432 

Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 

130 135 140 

teg gec cag cca gaa tec gee gca etc cca gac gec ccc gcg teg acg 480 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 
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gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gec aga aag ctg 528 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gec ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gee ggc ttt aac aag cgc gtc ttc tgc gee gcg gtc ggg cgc ctg 624 

w Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val' Gly Arg Leu 

195 200 205 

gcg gec atg cat gee egg atg gcg gcg gtc cag etc tgg gac atg teg 672 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

15 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gee aac 768 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gee acg gcg 816 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gee 864 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 

275 280 285 

cca gee cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

agt ggc ggc cgc atg tec aat tta ctg acc gta cac caa aat ttg cct 1008 

40 Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 

325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

45 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 

355 360 365 

50 ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
55 385 390 395 400 



20 



25 



30 



35 
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tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 1248 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
405 410 415 

o 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

io gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
435 440 445 



15 



35 



40 



45 



55 



gtt gat gee ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 1392 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 



gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1440 

20 Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 

465 470 475 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac ace ctg tta 1488 

Asp He Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 

485 490 495 

25 

cgt ata gee gaa att gee agg ate agg gtt aaa gat ate tea cgt act 1536 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 

500 505 510 

30 gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1584 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 

515 520 525 



age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1632 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 

545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gee gcg cca tct gec 1728 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 

565 570 575 

acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1776 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 

580 585 590 



cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 

so 595 600 " 605 

gee tgg tct gga cac agt gee cgt gtc gga gee gcg cga gat atg gec 1872 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 



cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 



1920 
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Arg Ala Gly Val Ser lie Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 ' 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
5 Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 

645 650 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat tag 2004 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
w 660 665 



<210> 6 
<211> 667 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 



20 



<400> 6 
Met Thr 
1 



Ser Arg Arg Ser Val 
5 



Lys Ser Gly Pro Arg Glu Val Pro Arg 
10 15 



25 



30 



Asp Glu Tyr Glu Asp Leu Tyr 
20 

Pro Asp Ser Pro Pro Asp Thr 
35 

Ser Arg Gin Arg Gly Glu Val 
50 55 



Tyr Thr Pro Ser Ser Gly Met Ala Ser 
25 30 

Ser Arg Arg Gly Ala Leu Gin Thr Arg 
40 45 

Arg Phe Val Gin Tyr Asp Glu Ser Asp 
60 



35 



Tyr Ala Leu Tyr Gly Gly Ser 
65 70 

Val Pro Arg Thr Arg Arg Pro 
85 



Ser Ser Glu Asp Asp Glu His Pro Glu 
75 80 

Val Ser Gly Ala Val Leu Ser Gly Pro 
90 95 



40 



45 



Gly Pro Ala Arg Ala Pro Pro 
100 

Arg Thr Pro Thr Thr Ala Pro 
115 

Thr Lys Ala Pro Ala Ala Pro 

130 135 

Ser Ala Gin Pro Glu Ser Ala 
145 150 



Pro Pro Ala Gly Ser Gly 
105 

Arg Ala Pro Arg Thr Gin 
120 125 

Ala Ala Glu Thr Thr Arg 
140 

Ala Leu Pro Asp Ala Pro 
155 



Gly Ala Gly 
110 

Arg Val Ala 
Gly Arg Lys 



Ala Ser Thr 
160 



50 



Ala Pro Thr Arg Ser Lys Thr 
165 

His Phe Ser Thr Ala Pro Pro 
180 



Pro Ala Gin Gly Leu Ala Arg Lys Leu 

170 175 

Asn Pro Asp Ala Pro Trp Thr Pro Arg 

185 190 



55 



Val Ala Gly Phe Asn Lys Arg 
195 



Val Phe Cys Ala Ala Val Gly Arg Leu 
200 ^ 205 
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10 



15 



20 



Ala Ala 
210 

Arg Pro 
225 



Met His 



Arg Thr 



He Arg Val Thr 



Glu Leu 



Thr Arg 



Pro Ala 
290 

Leu Gly 
305 



Val Asn 
260 

Gly Arg 
275 

Arg Ser 
Ser Thr 



Ala Arg Met 
215 

Asp Glu Asp 
2 30 

Val Cys Glu 
245 

Pro Asp Val 



Ser Ala Ala 



Ala Ser Arg 
295 

Ser Pro Val 
310 



Ala Ala Val 

Leu Asn Glu 

Gly Lys Asn 
250 

Val Gin Asp 
265 

Ser Arg Pro 
280 

Pro Arg Arg 
Trp Trp Asn 



Gin Leu 
220 

Leu Leu 
235 



Trp Asp Met Ser 



Gly He Thr Thr 
240 



Leu Leu Gin Arg Ala Asn 
255 

Val Asp Ala Ala Thr Ala 
270 

Thr Glu Arg Pro Arg Ala 
285 



Pro Val 
300 

Ser Ala 
315 



Glu Gly Thr Glu 



Asp He Gin His 
320 



Ser Gly Gly Arg 



25 



30 



35 



40 



45 



50 



55 



Ala Leu 

Asp Met 

Leu Leu 
370 

Arg Lys 
385 

Tyr Leu 
Gly Gin 
Asp Ser 



Val Asp 
450 

Asp Phe 
465 



Pro Val 
340 

Phe Arg 
355 



Met Ser Asn Leu 

325 

Asp Ala Thr Ser 



Asp Arg Gin Ala 
360 



Leu Thr Val His Gin Asn Leu Pro 
330 335 

Asp Glu Val Arg Lys Asn Leu Met 
345 350 



Phe Ser Glu His 



Ser Val Cys Arg Ser Trp 
375 



Ala Ala 



Trp Phe Pro Ala Glu Pro Glu Asp 
390 



Gin Ala Arg Gly Leu Ala 
405 



Leu Asn 
420 

Asn Ala 
435 

Ala Gly 
Asp Gin 
Arg Asn 



Met Leu His Arg 



Val Ser Leu Val 
440 

Glu Arg Ala Lys 
455 



Val Lys 
410 

Arg Ser 
425 

Met Arg 
Gin Ala 



Val Arg Ser Leu Met Glu 
470 



Trp Cys 
380 

Val Arg 
395 

Thr He 
Gly Leu 
Arg He 



Leu Ala 
460 

Asn Ser 
475 



Thr Trp Lys Met 
365 

Lys Leu Asn Asn 



Asp Tyr Leu Leu 
400 

Gin Gin His Leu 
415 

Pro Arg Pro Ser 
430 

Arg Lys Glu Asn 
445 

Phe Glu Arg Thr 



Asp Arg Cys Gin 
480 



Leu Ala Phe Leu 
485 



Gly He 
490 



Asp He 
Arg He 

Asp Gly Gly Arg Met Leu He His He Gly 



Asn Thr Leu Leu 
495 



Ala Glu 
500 



He Ala Arg He 



Arg Val 
505 



Ala Tyr 
Lys Asp 
Arg Thr Lys Thr Leu Val 



He Ser Arg Thr 
510 
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515 520 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
5 530 535 540 

Val Glu Arg Trp lie Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
10 565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 



15 



His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

20 Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 

625 630 635 640 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 



25 



35 



40 



45 



Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 



<210> 7 
30 <211> 2004 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<220> 
<221> CDS 
<222> (1) . . (1998) 

<400> 7 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
1 b 10 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

ccc gat agt ccg cct gac acc tec cgc cgt ggc gee eta cag aca cgc 144 
50 Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 

35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 

50 55 60 

55 
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tat gcc etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 240 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 

65 70 75 1 80 

5 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 

85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gcc gga 336 

10 Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 

100 105 110 

cgc aca ccc ace ace gcc ccc egg gcc ccc cga acc cag egg gtg gcg 384 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 

15 115 120 125 

act aag gcc ccc gcg gcc ccg gcg gcg gag acc acc cgc ggc agg aaa 432 

Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 

130 135 140 

20 teg gcc cag cca gaa tec gcc gca etc cca gac gcc ccc gcg teg acg 4 80 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 

145 150 155 160 . 



25 



30 



35 



gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gcc aga aag ctg 528 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 

165 170 175 

cac ttt age acc gcc ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 

180 185 190 

gtg gcc ggc ttt aac aag cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg 624 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 

195 200 205 

gcg gcc atg cat gcc egg atg gcg gcg gtc cag etc tgg gac atg teg 672 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 

210 215 220 



cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
40 2 2 5 2 3 0 2 3 5 2 4 0 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gcc aac 768 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

45 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gcc acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gcc 864 
50 Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 

275 280 285 

cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

55 
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etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

5 

agt ggc ggc cgc atg tec aat tta ctg acc gta cac caa aat ttg cct 1008 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 

10 Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 

340 345 350 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

75 

ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

20 egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 * 390 395 400 



25 



tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 1248 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
405 410 415 



ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
30 4 2 0 4 2 5 4 3 0 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
435 440 445 

35 gtt gat gec ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 1392 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1440 
40 Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 

465 470 475 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 1488 

Asp lie Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 
485 490 495 

45 

cgt ata gec gaa att gee agg ate agg gtt aaa gat ate tea cgt act 1536 

Arg lie Ala Glu lie Ala Arg lie Arg Val Lys Asp lie Ser Arg Thr 
500 505 510 

so gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1584 

Asp Gly Gly Arg Met Leu lie His lie Gly Arg Thr Lys Thr Leu Val 
515 520 ~ 525 



55 



age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1632 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 
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gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 

Val Glu Arg Trp lie Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 

545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gee gcg cca tct gee 1728 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 

565 570 575 

w ace age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1776 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 
580 585 590 



15 



20 



25 



cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 

His Arg Leu lie Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gee tgg tct gga cac agt gee cgt gtc gga gec gcg cga gat atg gee 1872 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg ace 1920 

Arg Ala Gly Val Ser lie Pro Glu lie Met Gin Ala Gly Gly Trp Thr 

625 * 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 

Asn Val Asn He Val Met Asn Tyr lie Arg Asn Leu Asp Ser Glu Thr 
645 650 655 



30 ggg gca atg gtg cgc ctg ctg gaa gat ggc gattag 2004 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly 
660 665 

<210> 8 
35 <211> 666 

<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 



40 



45 



50 



55 



<400> 8 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
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Gly Pro 
Arg Thr 



85 

Ala Arg Ala 
100 

Pro Thr Thr 
115 



90 



95 



Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
105 110 

Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
120 125 



10 



15 



Thr Lys 
130 

Ser Ala 
145 

Ala Pro 



Ala Pro Ala 
Gin Pro Glu 



Thr Arg Ser 
165 



Ala Pro Ala 
135 

Ser Ala Ala 
150 

Lys Thr Pro 



Ala Glu Thr Thr Arg Gly Arg Lys 
140 

Leu Pro Asp Ala Pro Ala Ser Thr 

155 160 

Ala Gin Gly Leu Ala Arg Lys Leu 
170 175 



20 



His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 

180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 

195 200 205 



25 



30 



35 



40 



45 



50 



55 



Ala Ala 
210 

Arg Pro 
225 

lie Arg 



Glu Leu 



Thr Arg 



Pro Ala 
290 

Leu Gly 
305 



Met His Ala 



Arg Thr Asp 



Val Thr Val 
245 



Arg Met Ala 
215 

Glu Asp Leu 
230 

Cys Glu Gly 



Ala Val Gin Leu Trp Asp Met Ser 
220 

Asn Glu Leu Leu Gly lie Thr Thr 
235 240 



Lys Asn 
250 



Leu Leu Gin Arg Ala Asn 
255 



Val Asn 
2 60 

Gly Arg 
275 

Arg Ser 
Ser Thr 



Ser Gly Gly Arg 



Ala Leu 



Asp Met 



Leu Leu 
370 

Arg Lys 
385 



Pro Val 
340 

Phe Arg 
355 

Ser Val 
Trp Phe 



Pro Asp Val 

Ser Ala Ala 

Ala Ser Arg 
295 

Ser Pro Val 
310 

Met Ser Asn 
325 

Asp Ala Thr 
Asp Arg Gin 



Cys Arg Ser 
375 

Pro Ala Glu 
390 



Val Gin Asp 
265 

Ser Arg Pro 
280 

Pro Arg Arg 

Trp Trp Asn 

Leu Leu Thr 
330 

Ser Asp Glu 
345 

Ala Phe Ser 
360 

Trp Ala Ala 
Pro Glu Asp 



Val Asp Ala 



Thr Glu Arg 
285 

Pro Val Glu 
300 

Ser Ala Asp 
315 

Val His Gin 



Val Arg Lys 



Glu His Thr 
365 

Trp Cys Lys 
380 

Val Arg Asp 
395 



Ala Thr Ala 
270 

Pro Arg Ala 
Gly Thr Glu 



lie Gin His 
320 

Asn Leu Pro 
335 

Asn Leu Met 
350 

Trp Lys Met 



Leu Asn Asn 



Tyr Leu Leu 
400 
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Tyr Leu Gin Ala Arg 
405 

Gly Gin Leu Asn Met 
420 

Asp Ser Asn Ala Val 
435 



Gly Leu Ala Val Lys Thr He 
410 

Leu His Arg Arg Ser Gly Leu 
425 

Ser Leu Val Met Arg Arg He 
440 



Gin Gin Mis Leu 
415 

Pro Arg Pro Ser 
430 

Arg Lys Glu Asn 
445 



w 



Val Asp Ala Gly Glu 
450 



Arg Ala Lys Gin Ala Leu Ala 
455 " 460 



Phe Glu Arg Thr 
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Asp Phe Asp Gin Val 
4 65 

Asp He Arg Asn Leu 
485 

Arg He Ala Glu He 
500 

Asp Gly Gly Arg Met 
515 



Arg Ser Leu Met Glu Asn Ser 
470 475 

Ala Phe Leu Gly He Ala Tyr 
490 

Ala Arg He Arg Val Lys Asp 
505 

Leu He His He Gly Arg Thr 
520 



Asp Arg Cys Gin 
480 

Asn Thr Leu Leu 
495 

He Ser Arg Thr 
510 

Lys Thr Leu Val 
525 
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30 



35 



Ser Thr Ala Gly Val 
530 

Val Glu Arg Trp He 
545 

Tyr Leu Phe Cys Arg 
565 

Thr Ser Gin Leu Ser 
580 

His Arg Leu He Tyr 
595 

Ala Trp Ser Gly His 
610 



Glu Lys Ala Leu Ser Leu Gly 
535 540 

Ser Val Ser Gly Val Ala Asp 
550 555 



Val Thr Lys Leu 



Asp Pro Asn Asn 
560 



Val Arg Lys Asn Gly 
570 



Val Ala Ala Pro Ser Ala 
575 



Thr Arg Ala Leu Glu Gly He 
585 

Gly Ala Lys Asp Asp Ser Gly 
600 

Ser Ala Arg Val Gly Ala Ala 
615 620 



Phe Glu Ala Thr 
590 

Gin Arg Tyr Leu 
605 

Arg Asp Met Ala 
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Arg Ala Gly Val Ser 
625 



He Pro Glu He Met Gin Ala 
630 635 



Gly Gly Trp Thr 
640 



45 



Asn Val Asn He Val 
645 

Gly Ala Met Val Arg 
660 



Met Asn Tyr He Arg Asn Leu 
650 

Leu Leu Glu Asp Gly 
665 



Asp Ser Glu Thr 
655 



<210> 9 
50 <211> 33 

<212> DNA 

<213> Human immunodeficiency virus 
<400> 9 

tacggccgca agaagcgccg ccaacgccgc cgc 33 
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10 



<210> 10 
<211> 11 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 10 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



15 



20 



25 



<210> 11 
<211> 42 
<212> DNA 

<213> Human immunodeficiency virus 

<220> 
<221> CDS 
<222> (4) . . (42) 

<400> 11 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc 42 
Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
1 5 10 



<210> 12 
<211> 13 
<212> PRT 

30 <213> Human immunodeficiency virus 
<400> 12 

Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
15 10 

35 



Claims 

40 1 . Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain 

for preparing an agent for inducing target gene alterations in a living organism, wherein said living organism 
4 5 carries at least one or more recognition sites for said site-specific DNA recombinase integrated in its genome. 

2. The use of claim 1 wherein the site-specific DNA recombinase domain is selected from a recombinase protein 
derived from Cre, Flp, <J>C31 recombinase, and R recombinase and preferably is Cre having amino acids 1 5 to 357 
of SEQ ID NO: 2 or Flpe having amino acids 15 to 437 of SEQ ID NO: 4. 

50 

3. The use of claim 1 or 2 wherein the protein transduction domain is a protein derived from the Antennapedia protein 
of Drosophila, from the VP22 protein of HSV or from the TAT protein of HIV, and preferably is derived from the TAT 
protein. 

55 4. The use of claim 3, wherein the TAT protein comprises the amino acid sequence 
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YGRKKRQRRR (SEQ ID NO: 10). 

5. The use of claims 1 to 4, wherein the protein transduction domain is fused to the N-terminal of the site-specific 
DNA recombinase domain. 

6. The use of claims 1 to 5, wherein the protein transduction domain is fused to the site-specific DNA recombinase 
domain through a direct chemical bond or through a linker molecule. 

7. The use of claim 6, wherein the linker molecule is a short peptide having 1 to 20, preferably 1 to 10 amino acid 
residues. 

8. The use of claims 1 to 7, wherein said fusion protein further comprises additional functional sequences. 

9. The use of claim 1 , wherein the fusion protein has the sequence shown in SEQ ID NOs: 2, 4, 6 or 8. 

10. The use of claims 1 to 8, wherein the living organism is a vertebrate, preferably a rodent or a fish. 

11. A method for inducing gene alterations in a living organism which comprises administering to said living organism, 
a fusion protein comprising a site-specific DNA recombinase domain and a protein transduction domain as defined 
in claims 1 to 9, wherein said living organism carries at least one or more recognition sites for said site-specific 
DNA recombinase integrated in its genome. 

12. A fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain 

provided that when (a) is Flp then (b) is not the VP22 protein of HSV. 

13. The fusion protein of claim 11 being as defined in claims 2 to 9. 

14. The fusion of claim 12 or 13, wherein the protein transduction domain is derived from the TAT protein of HIV. 

15. A DNA sequence coding for the fusion protein of claim 12. 

16. The DNA sequence of claim 15 comprising the sequence shown in SEQ ID NOs: 9 and/or 11 . 

17. A vector comprising the DNA sequence of claim 15. 

18. A host cell transformed with the vector of claim 17 and/or comprising the DNA of claim 15. 

19. A method for producing the fusion protein of claim 11 which comprises culturing the transformed host cell of claim 
17 and isolating the fusion protein. 

20. An injectable composition comprising the fusion protein as defined in claims 1 to 9 or12 to14. 
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Fig* 1 




RSS-flanked recombination Induced mouse 
target gene mutant 
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Use of a fusion protein comprising a site-specific DNA 
recombinase domain and a protein transduction domain for 
preparing an agent for inducing target gene alterations in a 
living organism carrying at least one recognition site In 
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1s the Cre recomblnase and said protein transduction domain 
is derived from Antennapedia protein of Drosophlla; 
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2. Claims: 1-3,5-13,15,17-20 (all partially) 

As for subject 1, but wherein said site-specific recomblnase 
is the Cre recomblnase and said protein transduction domain 
1s derived from VP22 of HSV, eventually as presented in 
SeqIdNo.6 



3. Claims: 1-20 (all partially) 

As for subject 1, but wherein said site-specific recomblnase 
1s the Cre recomblnase and said protein transduction domain 
1s derived from Tat of HIV, eventually as presented 1n 
SeqIdNo.2, eventually comprising the Tat-derived sequences 
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As for subject 1, but wherein said site-specific recomblnase 
is the Flp recombinase or its modified variant Flpe, and 
said protein transduction domain is derived from AntP of 
Drosophi la 
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As for subject 1, but wherein said site-specific recombinase 
is the Flp recombinase or Its modified variant Flpe, and 
said protein transduction domain 1s derived from VP22 of 
HSV, eventually as presented in SeqIdNo.8 



6. Claims: 1-20 (all partially) 

As for subject 1, but wherein said site-specific recombinase 
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eventually as presented in SeqIdNo>4 t eventually comprising 
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As for subject 1, but wherein said site-specific recombinase 
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